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THE  INVERSE  SCATTERING  AND  TARGET  IDENTIFICATION  PROBLEM 
K.J.  Breeding  and  A. A.  Ksienski 


I.  INTRODUCTION 

In  order  to  obtain  an  exact  solution  to  the  inverse  scattering 
problem,  the  target  response  is  required  over  a  continuous  band  of 
frequencies  and  aspect  angles. [1]  An  alternative  approach  to  the  problem 
which  requires  a  much  more  modest  amount  of  measurement  data  but  assumes 
substantial  a  priori  information  is  as  follows:  A  finite  set  of 
alternatives  is  specified  regarding  target  shape  and  composition  and  a 
set  of  measurements  is  carried  out  to  provide  the  answer  as  to  which  one 
of  the  alternatives  holds.  Since  it  is  necessary  to  restrict  the  measure¬ 
ments  to  a  relatively  small  range  of  frequencies  the  question  -rises  as  to 
what  frequency  band  is  best  to  characterize  the  target  and  to  provide  the 
most  reliable  discrimination  from  other  targets.  The  best  apparent  choice 
is  the  highest  frequency  possible  since  for  a  given  per  cent  bandwidth  it 
would  contribute  the  greatest  amount  of  information.  Also,  the  high 
frequencies  provide  the  resolution  which  yield  fine  detail.  The  high 
frequency  approach  has  been  the  most  widely  accepted  approach  for  obtaining 
target  signatures.  However,  an  examination  of  the  Fourier  transform  of 
the  impulse  response  indicated  that  the  frequency  range  corresponding  to 
wavelengths  starting  with  the  size  of  the  object  and  increasing  to  ten 
times  its  dimension  would  provide  the  most  useful  initial  information. [2, 3] 
At  higher  frequencies  the  responses  of  various  objects  appear  to  be  less 
typical  of  any  specific  object.  This  explains  the  failure  of  the  "target 
signature"  approach  where  the  scattering  cross  sections  were  examined 
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mostly  at  high  frequencies.  This  does  not  imply  that  the  upper  end  of 
the  spectrum  is  not  useful  for  the  characterization  of  objects,  however 
it  must  be  utilized  in  conjunction  with  the  lower  frequency  response 
to  provide  a  meaningful  description.  The  higher  frequencies  characterize 
the  finer  detail  of  the  cbject  while  the  lower  frequencies  provide  the 
gross  detail  such  as  overall  dimensions,  approximate  shape,  and  material. 
Obviously  presenting  only  fine  detail  yields  a  rather  confusing  picture. 

It  is  the  low  frequency  range  which  is  used  in  this  paper  to 
characterize  the  various  objects  of  interest,  and  provides  the  infor¬ 
mation  used  to  identify  the  objects,  i.e.,  to  choose  the  alternative 
which  is  most  likely  to  be  true.  The  term  "likelihood"  rather  than 
"certainty"  is  used  because  of  the  incomplete  knowledge  of  the  target 
response  as  represented  by  a  limited  frequency  and  aspect  angle  sampling 
and,  of  course,  due  to  noise  and  measurement  errors.  To  determine 
whether  the  various  targets  of  interest  can  be  reliably  identified  based 
on  the  proposed  low  frequency  characterization  it  must  be  shown  that 
each  target  is  represented  in  nonoverlapping  regions  of  an  appropriate 
space,  such  that  when  a  certain  set  of  radar  measurements  have  been 
made  a  decision  could  be  reached  with  regard  to  the  target  and  that 
such  decision  would  have  a  high  probability  of  being  correct. 

The  specific  set  of  frequencies  chosen  to  illuminate  the  target 
are  12  harmonically  related  frequencies  in  the  Rayleigh  anu  first 
resonance  region.  The  range  of  aspect  angles  has  been  as  complete 
as  possible  to  provide  the  most  comprehensive  representation  of  the 
object.  The  radar  returns  provide  both  phase  and  amplitude  information 
which  would  yield  a  24  dimensional  vector  which  could  then  represent 
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each  aspect  angle  of  the  object.  The  set  of  points,  in  such  a  24 
dimensional  space,  corresponding  to  the  various  aspect  angles  would 
then  delineate  the  region  of  the  space  which  would  characterize  the 
particular  object  and  hopefully  would  do  it  uniquely,  i.e.,  without 
overlapping  a  region  corresponding  to  another  object.  Although,  as 
mentioned,  the  phase  information  is  available,  it  was  decided  to  utilize 
only  the  amplitudes  of  the  radar  returns.  The  reason  for  this  decision 
is  that  to  date  no  satisfactory  method  has  been  found  to  remove  the 
sensitivity  of  the  phase  data  to  renge,  and  since  the  fundamental 
properties  of  class  separability  was  investigated  data  with  large  potential 
for  errors  were  considered  undesirable. 

Thus,  each  object  was  represented  by  a  set  of  real  12-tuples 
x  =  (x-j,  Xg,  •••,  x-|2)  aach  element  of  which  corresponded  to  the  magnitude 
of  the  return  signal  at  the  frequency  30i  MHz  (where  i  =  1 ,  •••  12). 

There  is,  of  course,  one  such  vector  for  each  aspect  angle  associated  with 
the  objects  in  question. 

Various  aspects  of  target  classification  utilizing  the  *bove 
spectral  representation  have  been  studied.  In  Section  II  th.  linear 
separability  of  the  various  classes  is  considered.  Section  III  presents 
a  specific  approach  for  the  design  of  an  automatic  target  classifier  and 
its  performance  is  tested  for  various  objects  in  the  presence  of  noise  and 
errors.  Section  IV  discusses  adaptive  procedure  for  target  classification. 
Section  V  presents  a  method  for  identification  of  target  dynamics  and, 
finally.  Section  VI  considers  the  use  of  nonlinear  decision  surfaces  for 
the  target  classification. 
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II.  LINEAR  SEPARABILITY  OF  OBJECT  CLASSES 

Since  the  data  is  represented  as  a  set  of  real  12-tuples  for  each 
object,  the  possibility  exists  of  attempting  a  linear  separation  of 
classes  in  12  space.  In  general  such  linear  separation  may  be  des¬ 
cribed  as  follows.  Let  S  ~  {op  a2,  ••*,  ar)  be  a  set  of  r  objects  to 
be  classified.  Associated  with  each  object  a^eS  is  a  set  of  n- tuples 
(in  our  case,  12-tuples),  A av  Without  loss  of  generality,  define  the 
sets 


(1) 


A  =  Aa 
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and 


(2) 


8  *  U  Aa.,  2  <  p  <  r. 
j=2  J 


The  sets  A  and  B  are  said  to  be  linearly  separable  if,  and  only  if, 
there  exists  a  hyperplane,  H,  characterized  by  the  separating  vector, 
[u-j ,  o>2,  •••»  T]  =  [o;  T]  such  that 


(3) 


Si) 


(j) 


«  •  a  >  T,  for  all  a  e  A 


and 


(*} 


+  Jj)  *(i) 

u  •  £  <  T,  for  all  b  e  B, 


where  u  •  x  denotes  the  inner  product  of  vectors  u  and  x. 


!  . 


I 


?  : 
I  ! 


U 


u 


4 


In  general  measurement  errors  and  other  forms  of  noise  may  be 
•introduced  into  the  data.  Since  a  solution  to  (3)  and  (4)  consists 
of  determining  a  real  vector  tS  and  a  real  number  T  such  that  these 
inequalities  are  satisfied,  any  small  error  in  new  data  may  place  the 
resulting  point  on  the  wrong  side  of  the  hyperplane.  To  reduce  the 
effects  of  minor  variations  in  data  due  to  noise  or  measurement  errors, 
it  is  desirable  to  have  no  points  in  the  original  training  sets,  A  and 
B,  fall  on  the  hyperplane.  This  objective  may  formally  be  handled  by 
reformulating  the  original  problem  as 


(5) 


Ji)  Jj) 

a  >  T  +  6,  for  all  a  e  A 


and 

(6) 


+  *  Jj) 

w  •  S  <  T  -  6 ,  for  all  ft  e  B 


where  <5  is  an  arbitrarily  small  positive  real  number.  These  inequali¬ 
ties  may  be  normalized  by  dividing  through  by  &  and  replacing  a>  by  «/<5 
and  T  by  T/6.  In  so  doing  the  initial  problem  of  determining  the 
separating  vector  [u;  T]  from  the  original  training  sets  may  be  reformu¬ 
lated  as 


(7) 
and 

(8) 


-> 

0) 


(j) 


Jj) 


't  • 


>  T  +  1 ,  for  all  a 


JLS)  Jtt) 

>  <  T  -  1 ,  for  all  b  e  B. 
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Problems  of  the  type  given  by  inequalities  (7)  and  (8)  will,  of 
course,  have  an  infinity  of  solutions  if  one  exists  at  all.  There  are 
many  ways  of  solving  problems  formulated  as  in  (7)  and  (8).  One 
particularly  appropriate  procedure  which  determines  a  specified  solution, 
assuming  one  exists,  is  that  of  linear  programming. [4]  In  this  approach 
„  inequalities  (7)  and  (8)  are  solved  subject  to  the  condition  that  some 
linear  form  is  to  be  minimized.  It  has  been  shown[5]  that  the  relia* 
bility  of  a  threshold  device  used  to  realize  a  linear  classifier  is 
increased  if  the  value  of  T  in  (7)  ar.d  (8)  is  minimized.  Thus,  the 
objective  function  of  minimizing  T  subject  to  the  constraints  im¬ 
posed  by  (7)  and  (8)  forms  the  required  linear  programming  problem. 

Results 

Referring  to  the  original  backscatter  data  it  will  be  observed 
that  this  original  data,  which  forms  the  training  sets,  consisted  of 
10  points  representing  the  prolate  spheroid  (P.S.),  10  points  for  the 
cube  (c),  5  points  for  the  hemispherical  boss  (MSB),  3  sphere  points 
(S),  and  2  points  for  the  wire.  To  investigate  the  possibility  of 
linear  separability,  hyperplanes  were  found  based  on  the  formulation 
in  (7)  and  (8)  which  separated  each  object  from  all  the  others.  The 
somewhat  unexpected  result  was  that  this  was  possible.  The  resulting 
hyperplane  direction  numbers  are  shown  in  Table  I.  Two  rather  inter¬ 
esting  observations  may  be  made.  First,  note  that  in  each  case  T  =  0. 
This,  of  course,  means  that  the  respective  separating  hyperplane  passes 
through  the  origin.  The  physical  reason  for  this  is  not  at  present  clear. 
The  second  observation  is  that  =  0  in  all  cases.  It  has  been  conject¬ 

ured  [2]  that  this  component,  which  corresponds  to  the  first  resonant 
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peak,  is  in  some  way  indicative  of  the  overall  size  of  the  target. 

Since  each  of  the  objects  considered  were  roughly  of  the  same  size,  this 
first  resonant  peak  would  then  appear  at  about  the  same  frequency  in  each 
case.  Thus,  this  component  can  play  no  part  in  the  overall  separation. 

Since  each  of  the  objects  is  separable  from  all  others  taken  to¬ 
gether,  they  must  be  pairwise  separable.  These  pairwise  separating 
hyperplanes  were  next  calculated  and  their  parameters  are  given  in 
Table  II.  It  should  be  observed  once  again  that  T  =  0,  =  0,  as  would 

be  expected  from  the  results  shown  in  Table  I.  These  pairwise  separating 
hyperplanes  are  used  later  as  a  basis  for  an  automatic  classification 
procedure.  One  further  comment  should  be  made  here.  Each  occurrence  of  a 
zero  in  Tables  I  and  II  implies  that  there  exists  some  characteristic 
similarity  between  the  classes  of  objects  in  question  and  indicates  that 
other  dimensions  are  more  effective  in  the  identification  of  the  object. 
Exactly  what  these  similarities  may  be  due  to  is  not,  at  this  time,  known. 

Since  only  a  relatively  few  number  of  points  were  used  to  de¬ 
termine  the  separating  hyperplanes,  it  is  fair  to  wonder  whether 
separation  is  maintained  for  data  obtained  at  different  aspect  angles 
than  used  initially.  To  this  end,  the  data  for  the  cube,  hemi -spherical 
boss,  and  prolate  spheroid  were  interpolated  to  produce  data  at  1° 
increments  in  aspect  angle.  The  interpolation  procedure  used  is  dis¬ 
cussed  in  detail  in  Appendix  A.  These  interpolated  data  were  then  com¬ 
pared  with  each  of  the  hyperplanes  of  Table  II  with  the  result  that  the 
separation  specified  by  inequalities  (7)  and  (8)  was  maintained.  This 
fact  indicates  that  the  data  representations  taken  for  the  objects  is 
apparently  quite  indicative  of  the  objects  and  further  that  these 
points  must  strongly  cluster  in  the  Euclidian  12  space,  E-^. 
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As  was  mentioned  earlier,  the  inequalities  (3)  and  (4)  were  modi¬ 
fied  to  inequalities  (7)  and  (3}  as  an  attempt  to  reduce  the  effect  of 
noise  on  the  identification  of  the  objects-  To  examine  the  consequences 
of  this  modification,  two  experiments  were  performed  on  the  data  for  the 
prolate  spheroid,  the  three  spheres,  and  five  of  the  cube  data  points. 

In  the  first  experiment  uniformly  distributed  noise  bounded  in  absolute 
value  by  a  constant,  a,  was  added  to  the  elements  of  the  cube  and  sphere 
backscatter  data.  New  separating  hyperplanes  were  then  computed  based 
on  inequalities  (7)  and  (8).  The  results  of  these  experiments  are  given 
in  Table  III  where  the  signal  to  noise  ratio,  S/N,  is  based  on  the  average 
return  signal  amplitude  and  the  average  of  the  absolute  value  of  the 
applied  noise.  The  first  observation  to  be  made  from  this  data  is  that 
the  general  topology  of  the  points  in  E.j2  are  not  appreciably  affected  by 
the  addition  of  noise  whose  average  level  is  significantly  below  that  of 
the  signal.  This  is  indicated  by  the  fact  that  the  resulting  hyperplanes 
are  only  slightly  varied  from  their  original  positions  up  to  an  S/N  of 
19  dB.  However,  when  the  S/N  becomes  -  IdB  the  situation  changes  con¬ 
siderably.  First,  interestingly  enough,  the  sets  of  points  are  linearly 
separable  but  the  separating  hyperplanes  orientation  has  been  considerably 
altered  from  the  no  noise  situation.  These  results  thus,  indirectly 
reinforce  the  choice  for  inequalities  (7)  and  (8). 

In  the  second  experiment  noise  in  the  same  steps  as  for  Table  III 
was  added  to  the  sphere,  cube,  and  prolate  spheroid  data.  The  relative 
distance,  u  •  x.  from  the  sphere/cube  hyperplane  was  then  computed. 

These  computations  are  based,  effectively,  on  inequalities  (3)  and  (4). 
This  data  is  shown  in  Table  IV.  In  this  table  a  positive  number  means 


10 


TABLE  ill 

SPHERE/CUBE  HYPERPLANES  WITH  NOISE  ADDED  TO  RETURN  SIGNAL 

S/N  36  dB  19  dB -1  dB 19  dB 


No  *01  >_|noi  se  |  .l>Jnoise|  1>Jnoise|  .  1  >J  noise  | 

Noise  S.only  S.only  S.only  C  S 


“1 

0 

0 

0 

0 

0 

u>2 

0 

0 

0 

0 

0 

“3 

.5 

.41 

.27 

0 

.26 

“4 

0 

0 

0 

.06 

0 

“5 

0 

0 

0 

1.46 

0 

“6 

4.72 

4.84 

7.21 

0 

6.99 

! 

“7 

0 

0 

0 

-.57 

0 

“8 

-3.35 

-3.28 

-3.94 

-1.46 

-4.27 

! 

“9 

1.32 

1.26 

1.26 

1.04 

1.94 

“10 

0 

0 

0 

0 

o 

“11 

0 

0 

0 

0 

0 

“12 

0 

C 

0 

0 

0 

T 

0 

0 

0 

0 

0 

PS10 


2.57 


2.65 


2.38 


3.51 


the  data  point  is  above  the  hyperplane.“  Notice  from  this  table,  that  no 
points  are  misclassified  until  the  S/N  becomes  about  -  1  dB.  This  fact 
confirms  the  observations  made  from  the  data  of  Table  III  and  justifies 
the  decision  to  determine  the  separatin':  hyperplanes  from  the  training 
data  based  on  inequalities  (7)  and  (8).  Notice  one  further  thing  in 
Table  IV,  that  the  sphere  and  the  prolate  spheroid  data  are  located  on 
the  same  side  of  the  sphere/cube  hyperplane.  This  result  might  have  been 
anticipated  from  the  fact  that  these  two  objects,  spheres  and  prolate 
spheroids,  are  geometrically  quite  similar. 

III.  AUTOMATIC  TARGET  CLASSIFICATION  BASED 
ON  SPECTRAL  RESPONSE 

The  result  that  the  classes  of  objects  studied  above  are  linearly 
separable  suggests  a  possible  scheme  for  automatically  identifying 
targets.  Let 

S  =  Ia-| ,  ••*,  or> 

be  the  set  of  r  object  classes  with  each  object  represented  by  the 
training  set 

LO.l)  *(1.2)  JW' 

Ao,  =  Ha  ,  a  ,  a  >,  l  *  1 ,  2,  •  •  • ,  r 
1  1 

of  n-tuples.  In  accordance  with  the  results  of  Section  II  assume  that 
every  pair  of  object  sets,  Ac.  and  A a-,  are  linearly  separable  by  the 

*  J 

hyperplane  H.  ..  Each  such  hy pi -plane  is  represented  by  a  separating 
» 

vector  •  T-  .]  such  that 

JUS)  Ji.k)  Ji'.k) 

(9)  u  •  a  —  ^i j  +  ^  *or  a^  a  £ 
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and 


00) 


JiJ) 

to  •  a 


iTij  - 1 for  al!  s 


(j>&) 


e  Ao. 
J 


Thus,  inequalities  (9)  and  00)  determine  a  separating  hyperplane  for 
any  pair  of  training  sets  Aa^  and  A a^.  Since  there  are  r  classes  of 
objects  and  since  u^1*^  =  -  and  T.  .  =  -  T .  . ,  from  (9)  and  (10), 

*  »J  J  »  » 

for  i  t  j,  i,j  =  1,  •••  r  there  will  be  r(r  -  l)/2  distinct  hyperplanes. 
These  hyperplanes  may  now  be  used  to  identify  objects  in  the  set  S  or 
to  help  in  the  classification  of  new  objects  on  data  which  was  not  in 
the  original  training  sets. 

Assume  that  the  hyperplane  H.  .  have  been  determined  from  the 

*  >J 

training  sets.  Further  assume  that  an  unknown  object  xeS  is  the 
target  and  let 


be  a  set  of  points  characterizing  x*  The  objective  now  is  to  attempt 

-*(k) 

an  identification  of  x  as  one  of  the  a.,  e  S.  For  each  xv  e  B  a 
tentative  classification  of  that  point  into  catagory  may  be  made  on 
the  basis  of  the  inequality 

Ji,j)  Jk) 

(11)  a)  -x  >  T.  i  t  j,  j  =  1 ,  2,  •••  r. 

This  criteria  is  used  for  two  purposes.  One  is  to  avoid  any  ambiguity 
arising  due  to  noise  on  the  data  and  the  other  is  to  classify  only 
those  data  which  lie  above  and  not  on  the  test  hyperplane. 
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Any  point  satisfying  (11)  will  go  into  catagory  o.  and  any  point 
lying  on  the  hyperplane  in  question  should  properly  be  ignored.  Tin's 
may  be  done  by  defining  the  following.  Let 


(12) 


.<« .  J 

>ij  -  1 


^(i.j)  ^(k) 

i  if  a)  •  X  -  T.  .  >  0 


i 


0  otherwise 


and  let 


(13) 


1 

l 

j= 1 
j^i 


6ij 


(kl  -4-fkl 

The  metric  '  thus  defines  the  number  of  times  the  point  xv  ‘  fell 

on  the  c ^  side  of  the  hyperplanes  separating  oi  from  the  other  r  -  1 

classes  of  objects.  Clearly  if  =  r  -  1  then  x^  could,  with 

a  high  degree  of  certainty,  be  classed  as  representing  some  view  of  the 

object  a..  In  general,  however,  <  r  *  1  and,  in  fact,  p  ^  f  0, 

I  I  "  X 

i  f  i,  for  the  general  case.  If  x  =  o-  then  p.^  >  p  for  all  k  and 
i  f  a.  Since  it  is  desirable  to  class  each  x^  into  one  of  the  classes 
of  r  catagories  it  is  desirable  to  have  some  metric  which  indicates  tne 
proportionate  classifications  of  each  of  the  object  points.  Thus,  define 


f  P-j(k) 


(14) 


(k)  _  ^ 


r  .  j  ,  Pi(k)  =  Max 


j-V*-,r  3 

otherwise  . 


<p<k)> 


From  (14)  a  final  metric  representing  all  of  the  data  in  B  may  be 
defined  as 
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j \-C-t'J ^ ,>-f\ £-r. itvn-i ^ti*.’!***?.* 


I  c.(k) 

(15)  Ci  =  1^—  . 

S 

It  should  be  noted  that  T  C-  is  not  in  general  equal  to  one,  and,  thus, 

i=l  1 

cannot  be  construed  as  a  probability  of  classification.  They  are,  however, 
closely  related  to  the  probability  of  classification  with  the  main  dis¬ 
crepancy  due  to  the  vanishing  of  some  of  the  distances.  If  none  of  the 
points  fall  on  the  plane  itself  the  sum  of  C.  would,  indeed,  be  unity. 

Thus,  the  value  o*  C.  is  bounded  from  above  by  the  probability  measure, 
hence,  it  is  a  conservative  measure.  For  example,  if  C.  =  .8  it  can  be 
deduced  that  at  least  80%  of  the  points  of  set  B  fell  on  the  side  of 
the  hyperplane.  In  the  experiment  to  be. described  below,  if  the  data 
points  Aa.j  are  introduced  into  the  classifier  and  the  result  is  £.  =  .8 
the  conclusion  may  be  drawn  that  the  conditional  probability  or  the 
likelihood  function  p( C^/Aa^ )  .8;  that  is  the  probability  of  the  object 

being  classified  correctly  is  at  least  80%. 

The  experiment  carried  out  utilized  the  pairwise  separating  hyper¬ 
planes  given  in  Table  II.  However,  the  test  data  used  were  obtained  by 
interpolation  of  the  original  training  data  for  look  angles  which  were 
not  previously  measured  or  computed  (see  Appendix  A  for  the  interpolation 
method),  they  constituted,  therefore,  an  appropriate  test  set.  In 
addition,  noisy  data  were  used  to  test  the  ability  of  the  classifier  to 
operate  on  data  contaminated  by  various  amounts  of  noise  or  measurement 
errors.  Finally  a  set  of  data  representing  an  object  not  previously 
included  in  the  classification,  namely  a  cylinder,  were  tested.  The 
experimental  data  used  are  described  below  with  the  final  metric  of 


L 

i 
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equation  (15)  given  for  each  of  these  experiments  in  Table  V.  The 
notation  used  here  is  simplified  to  denote  only  the  object  class,  . 
for  which  the  metric  is  a  I'epresentativc-.  The  experiments  performed 
are  as  follows. 

1.  Interpolated  cube  data  for  <j>  =  45°,  3  =  15°,  16J,  •••  24°, 
and  4>  polarization  with  10  points  in  the  sample  set. 

2.  Interpolated  cube  data  for  <p  =  0°,  e  =  0°,  1°,  2°,  •  •  • ,  9° 
and  <f>  polarization  with  10  points  in  the  sample  set. 

3.  Interpolated  data  for  hemi -spherical  boss  for  0  =  15°,  16°, 

A 

17°,  24°  and  polarization  with  10  points  in  sample  set. 

4.  Interpolated  prolate  sphere  data  for  e  =  16°,  17°,  18°,  19° 

A 

and  <j>  polarization  with  4  points  in  sample  set. 

5.  Original  prolate  spheroid  data  with  noise  added  to  produce 
S/N  =  36  dB. 

6.  Same  as  5  except  that  the  noise  added  produced  S/N  =  19  dB. 

7.  Same  as  5  except  that  the  noise  added  produced  S/N  =  -1  dB. 

8.  Five  points  of  the  original  cube  data  for  <j>  =  0,  0  =  0,  15, 

30,  45,  and  4>  =  45 ,  0  =  15  with  noise  added  to  produce  S/N  = 

36  dB. 

9.  Sams  as  8  except  that  the  noise  added  produced  S/N  =  19  dB. 

10.  Same  as  8  except  that  the  noise  added  produced  S/N  =  -1  dB. 

11.  Original  three  sphere  data  point  with  noise  added  to  produce 
an  S/N  =  36  d3, 

12.  Same  as  11  except  that  the  noise  added  produced  an  S/N  =  19  •  . 

13.  Same  as  11  except  that  the  noise  added  produced  an  S/N  =  -1  dB. 
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TABLE  V 

EXPERIMENTAL  RECOGNITION  RESULTS 
AS  INDICATED  BY  THE  PARAMETER  ^ 


!  Test 

PS 

s 

HSB 

i 

- 

c 

•  "  1  •"  *i 

W 

I 

0 

0 

0 

i 

1.0 

0 

2 

0 

0 

0 

1 .0 

0 

3 

0 

0 

1.0 

0 

0 

4 

1.0 

0 

0 

0 

0 

5 

1-0 

0 

0 

0 

c 

6 

1.0 

0 

0 

0 

0 

7 

•7 

0 

0 

.1 

.15 

8 

0 

0 

0 

.8 

.15 

S 

0 

.2 

0 

.6 

.15 

10 

0 

0 

0 

.4 

.5 

11 

0 

1 

9 

0 

0 

12 

0 

.567 

0 

.333 

0 

13 

.333 

0 

0 

.333 

.333 

14 

.7 

0 

0 

0 

.225 

15 

i 

.1 

0 

0 

0.1 

.65 
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14.  Ten  points  representing  a  right  circular  cylinder  having  a 
diameter  of  D  =  .635  m  and  length  L  =  1  27  m  oriented  along 
the  Z  axis  for  6  =  0°,  10°,  20° ,  •••,  90°  and  e  polarization. 

15.  The  same  cylinder  as  described  in  14  but  with  <j>  polarization 
of  illuminating  signal. 

First  it  may  be  noted  that  the  interpolated  test  set  experiments 
1-4  were  classified  without  any  errors.  The  classification  of  the  prolate 
spheroid  is  highly  reliable  even  in  the  presence  of  noise.  As  experi¬ 
ments  5-7  indicate  no  errors  occurred  up  to  S/N  of  19  dB  and  even  at 
S/N  =  -  1  dB  the  probability  of  correct  classification  was  no  less  than 
70$.  The  cube  and  sphere  classification  was  less  resistant  to  noise 
(experiments  8-13),  however,  when  wrong  classifications  were  made  the 
reliability  factor  was  indicated  as  very  low,  for  example,  when  the  sphere 
was  classified  as  a  cube,  wire  or  prolate  spheroid,  it  was  indicated  that 
the  choices  carried  only  about  30%  reliability,  stressing  the  need  for 
further  data  to  obtain  a  more  reliable  decision.  Whenever  the  reliability 
indicator  exceeded  60%  the  choice  was  found  to  be  correct. 

The  last  two  experiments  are  of  particular  interest.  Consider  ex- 
peri  ment  15  first.  Here  a  right  circular  cylinder  was  ill**  ;iated  in  the 
same  way  that  the  thin  wire  (itself  a  cylinder  of  dimensions  D  =  .05  m 
and  L  =  1  m)  was  illuminated.  The  .65  in  Table  5  experiment  15  indicates 
yery  strongly  the  obvious  connection  between  the  wire  and  the  cylinder. 

In  experiment  14,  on  the  other  hand,  the  polarization  vks  along  the 
illumination  axis  or  9.  The  identification  for  this  case  resulted  in 
the  cylinder  appearing  as  a  prolate  spheroid.  There  is  a  striking 
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similarity  here.  The  ratio  of  the  length  to  diameter  of  the  cylinder, 

2,  is  the  same  as  the  ratio  of  the  major  axis  to  minor  axis  of  the 
prolate  spheroid.  Further  if  the  discontinuities  of  the  cylinder  were 
removed  a  prolate  spheroid  could  easily  be  produced.  These  facts  tend 
to  reinforce  the  opinion  that  the  spectral  response  data  tends  to 
cluster  in  12  space  in  accordance  with  the  general  size  and  shape  of 
the  target  object. 

IV.  ADAPTIVE  DETERMINATION  OF  LINEAR  SEPARATING 

HYPERPLANES 

The  solution  of  the  inequalities  (7)  and  (8)  may  be  handled  in  a 
number  cf  ways.  One  of  which  is  the  simplex  method  of  linear  programming 
which  was  used  in  the  previous  sections.  The  main  difficulty  with  this 
procedure  is  that  it  is  generally  unable  t**  handle  new  or  modified  data 
sets.  It  has  been  shown, [6]  however,  that  suitable  modifications  of 
linear  programming  algorithms  can  produce  adaptible  linear  classifiers. 
There  are,  of  course,  many  other  algorithms  applicable  to  adaptive 
linear  classification. [7, 8,S]  In  all  of  these  algorithms  the  speed  of 
convergence  and  the  amount  of  computation  is  generally  quite  large. 

Adaptive  algorithms  generally  proceed  by  estimating  a  separating 
hyperplane  and  then  making  a  new  estimate  based  on  an  observation  of 
the  system  with  the  first  estimate.  The  change,  generally,  is  dictated 
by  the  data  which  is  incorrectly  classified.  The  method  by  which 
each  new  estimation  of  separating  hyperplanes  is  made  has  a  great  deal 
to  do  with  the  rapidity  of  convergence  as,  also,  does  the  initial  hyper¬ 
plane  erlimate. 


Let  A  and  B  be  two  sets  of  n-tuples  in  Euclidean  n  space,  En. 
If  A  and  B  are  linearly  separable,  then  there  exists  a  real  vector  co 
and  a  real  number  T  such  that 


(16) 

%  •  a(i)  >  T, 

07) 

w  *  6^'  <  T, 

. 

The  problem  now  is  to  determine  the  w  and  T.  Consider  Fig.  1.  A 
reasonable  approximation  to  the  first  hyperplane  may  be  made  based  on 
the  location  of  the  center  of  gravities  of  each  set  A  and  B,  In 
particular  let 
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where  r  and  s  are  the  orders  of  sets  A  and  B  respectively.  Then  as  an 
initial  estimate  of  the  separating  vector  let 


(20) 


(o) 


and 

(21)  T  =  Min  (^o)  •  a'1*),  e  A. 

0  4 


This  hyperplane  is  shown  in  Fig.  1  and  is  labeled  HQ.  If  all  of  the 
vectors  in  B  satisfy  (17)  then  clearly  this  choice  is  an  admissible 
separating  hyperplane.  Assume,  however,  that  this  is  not  the  case. 
Let 


(1) 

B 


SB 


be  the  set  of  vectors  which  do  not  satisfy  inequality  (17).  The  problem 
becomes  one  of  moving  the  hyperplane  so  that  these  vectors  will  be 
correctly  separated.  Referring  to  Fig.  1,  if  the  point  Pg^  were 
moved  in  the  general  direction  of  the  set  of  vectors  B^  then  the 
hyperolatie  HQ,  which  is  perpendicular  to  the  line  joining  and 

would  be  "tipped"  in  the  right  way  to  bring,  hopefully,  some  of 
the  incorrectly  separated  vectors  to  the  correct  side  of  the  hyperplane. 
It  turns  out,  however,  that  under  some  conditions  this  procedure  will 
produce  a  nyperplane  which  is  a  worse  approximation  than  the  first 
estimate. 
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To  avoid  this  situation  define  the  hyperplane  H  '  as  one  parallel 

to  H  but  with  a  T  '  defined  as 
o  o 


(22)  T  •  =  Max  (w(o)  •  t(j)),  S(j)  e  B 


Thus,  for  Hq  1 ,  inequalities  (17)  are  satisfied  for  all  b^  e  B  but 
inequalities  (16)  will  not,  in  general,  be  satisfied  for  all  e  A. 
Let 


,o>.  Js'V-s'V, ...  * <V 


r  —A 


be  the  set  of  vectors  in  A  which  do  not  satisfy  inequalities  (16) 
under  hyperplane  HQ'.  Using  this  information  point  PA^  may  be  moved 
in  the  general  direction  of  the  set  and  PB^  may  be  moved  in  the 
direction  of  the  set  B^'.  Thus,  define 


A  new  hyperplane  estimate  may  now  be  made  based  on  these  quantities  as 
fol 1 ows .  Oef i ne 
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(25) 


and 


(26)  T,  =  Min(£(1)  •  2(i)). 

i 

The  resulting  hyperplane  is  shown  in  Fig.  1  as  . 

In  this  illustration  H-j  succeeded  in  separating  sets  A  and  B.  In 
general  this  may  not  happen.  What  is  required  in  this  case  is  to  repeat 
the  above  procedure.  In  general  each  estimate  may  be  defined  as  follows 


(27) 


->(q)  (q)  -*  (q) 

_  pBva/ 


and 


(28)  T  =  Min  (£(g)  •  t(i)) 

9  i 


where 
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respectively. 

This  procedure  was  applied  to  some  40  different  problems  ranging 
in  dimensionality  from  2  to  5  variables.  Among  the  problems  which  had 
the  sets  A  and  B  consisting  of  binary  n  tuples  (0's  and  Vs),  the 
procedure  converged  for  all  problems  in  from  1  to  2  iterations.*  The 
class  of  problems  in  which  the  vectors  were  arbitrary  real  numbers  con¬ 
verged  generally  in  from  3-4  iterations.  One  real  vector  problem 
required  79  iterations.  Cnnparing  these  convergence  rates  with  other 
procedures  of  a  similar  nature  (cf  references  mentioned  above)  it  was 
found  that  in  all  cases  convergence  was  from  2  to  4  times  faster  than  the 
other  methods.  Thus,  the  possibility  of  using  this  procedure  for  an 
adaptive  identification  procedure  is  certainly  feasible. 


*  An  iteration  is  defined  here  as  a  check  of  inequalities  (16)  and  (17) 
for  all  vectors  in  A  and  B. 
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V.  IDENTIFICATION  OF  TARGET  DYNAMICS 

Linear  separability  may  be  used  for  not  only  identifying  target 
objects,  as  indicated  above,  but  also  as  a  method  of  inferring  infor¬ 
mation  concerning  target  dynamics.  As  an  illustration  of  this  process 
assume  that  the  object  e  S  has  cylindrical  symmetry  cr"i  is  the  target 
object.  Assure  further  that  this  object  is  rotating  in  some  way  in  free 
space.  At  a  set  of  discrete  time  intervals  data  is  taken  which  represents 
the  impulse  response  presented  by  a ■  at  these  time  intervals.*  Let  this 
set  of  data  be  3.  Then  each  e  B  represents  the  object  as  a  function 
of  its  aspect  angle  in  time. 

Once  the  object  represented  by  the  data  in  the  set  B  has  been 
identified  a  real  valued  function  of  time,  g{t),  may  be  defined  based 
on  some  separating  hyperplane,  H. .,  which  separates  data  representative 
of  a-  from  any  other  data  set.  Let  this  hyperplane  have  a  separating 
vector  of  T.  .].  Then  g(t)  may  be  defined  as 

*  5  J 

JiJ)  + 

(33)  g(t)  =  a)  •  x(t)  -  T,  .,  x(t)  e  B. 

* 

This  function  represents  the  relative  distance  between  the  hyperplane, 

H.  and  the  point  x(t)  which,  in  turn,  is  a  function  of  aspect  angle. 

*  »J 


*  It  may  be  assured  here  that  the  object  is  effectively  motionless  during 
the  measurement  interval. 
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At  training  time,  a  function  of  4>t  the  aspect  angle,  may  be 
generated  based  on  the  data  in  the  training  set  A 0^ .  For  the  case  under 
consideration  let  this  function  be 

„;i,j)  (i  »£)  _^(i  »a) 

(34)  -  a)  •  a  {^)  -  a  e  Aai 

f(^)  may  reasonably  be  assumed  to  be  a  well  behaved,  continuous  function 
of  aspect  angle.  The  aspect  angle,  ij>,  is  related  to  the  spherical 
coordinates,  8  and  <f>  as  shown  in  Fig.  2.  From  this  figure 


2 


Fig.  2.  Aspect  angle  of  rotational  axis  of  a 
cylindrical  symmetrical  object. 
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(35)  i|»  =  cos”1  (sine  cos<j>). 

In  general  both  e  and  <j)  are  functions  of  time.  Thus,  if  g(t)  can  be 
related  to  f(<j>)  at  each  instant  of  time  then  41(c)  may  be  found  and  from 

(35)  e(t)  and  $(t)  may  be  inferred.  This  inference  is,  of  course,  not 
without  ambiguity  in  the  general  case. 

Consider  as  an  example  the  following  case.  Let  the  object  be 
rotating  in  space  such  that  e  =  w0t  and  <f>  =  <t>Q.  Thus  is  rotating 
in  a  plane  making  an  angle  <j>Q  with  the  direction  of  illumination.  Thus, 
i|i(t)  becomes 

(36)  4)(t)  =  cos-1  (sin(a>Qt  +  0)  cos  «j>Q) , 

where  g  is  a  phase  angle  produced  by  the  time  at  which  observations  are 
begun. 

Assume  that  during  the  observational  period  a..  goes  through  at 
least  one  full  revolution.  Thus,  the  axis  of  symmetry  must  become  co- 
1 inear  with  the  z  axis  at  least  once.  If  this  instant  of  time  can  be 
identified  uniquely  then  a>0  is  found  from  the  period  of  revolution  and 
g  becomes  e  =  -  u>Q  tQ  where  t  is  the  time  at  which  4>(t0)  =  90°.  With 
this  information  at  hand  the  value  of  <j>Q  becomes  simply 

(37)  *  =  Min  (+(t)). 

0  t 

Clearly  this  class  of  target  dynamics  is  easily  recognized.  Other 
types  of  motion  become  more  difficult  to  identify.  This  identification 
may,  however,  be  made  based  on  the  training  function  f(4>)  if  a  one  to 
one  correspondence  between  f(  )  and  g(t)  can  be  found. 
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VI.  NON  LINEAR  DECISION  SURFACES 


The  application  of  separating  hyperplanes  to  classify  n-dimensional 
data,  as  was  done  above,  may  pose  problems  when  the  number  of  alternative 
objects  is  rather  large.  One,  in  particular,  is  that  a  single  object 
classifier  is  not  possible  due  to  the  necessity  of  having  at  least  two 
sets  of  data  to  define  a  separating  hyperplane.  One  data  set  is,  of 
course,  the  set  representing  the  target  of  interest.  The  second  data  set 
would  thus,  become  that  set  representing  all  target  objects  which  are  not 
the  chosen  one  -  a  most  formidable  array  of  data  if  the  number  of  object 
classes  is  large. 

A  second  difficulty  has  to  do  with  updating  the  number  of  target 
classes.  If  a  new  target  class  is  to  be  added  to  the  set  of  k  original 
classes  then  the  addition  will  require  determination  of  k  new  hyperplanes. 
Furthermore,  this  requires  that  the  original  data  sets  used  for  training 
be  available  for  the  addition  of  the  new  class. 

One  way  of  avoiding  these  difficulties  is  to  use  closed  separ¬ 
ating  hypersurface  such  as  "hyperboxes"  and  "hyperellipsoids."  A 
hyperbox  may  be  defined  in  n  space,  as  a  closed  surface  bounded  by  2n 
hyperplanes  such  that  each  hyperplane  is  parallel  to  exactly  one  other 
hyperplane  and  perpendicular  to  the  remaining  2n-2  hyperplanes.  One 
particularly  easy  hyperbox  to  specity  is  one  having  its  faces  perpen¬ 
dicular  to  the  coordinate  axis.  The  position  of  each  fac-  along  the 
axis  is  then  determined  by  the  maximum  and  minimum  value  of  the 
corresponding  component  over  the  set  of  data.  In  particular,  let 

=  {a-j,  a2,  a$>  , 
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where 


ai  =  ^ai  1  *  ai2’  "*»  ain^ 


be  a  set  of  data  representing  a  particular  target  object.  The  hyper¬ 
box  just  described  may  now  be  specified  by  the  two  n-tuples 


where 


“M  =  (a)Ml  ’  W 


m  v  ml  ’  mn' 


0Mi  s  Ma*  (aii) 

^  i=l  ,•  •  •  ,S  1J 


Min  (a..). 
i=l »• • • ,S  1J 


Thus,  a  point  x  will  be  in  the  hyperbox  if  and  only  if 


U)„.  <  x  <  0)M 
m  —  --  M 


and  thus,  associated  with  the  object  whose  training  set  is  A.  Al¬ 
though  such  an  identification  procedure  is  quite  sim>  it  has  a  number 

of  drawbacks.  Not  least  of  these  is  the  very  large  volume  of  the  hvper- 
box  which  does  not  contain  any  data  points  and  thus,  serves  no  useful 
classification  purpose  and,  in  fact,  may  contribute  to  erroneous 
classification. 
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To  avoid  such  problems,  it  would  appear  that  the  volume  of  the 
enclosing  hyperbox  should  be  minimized  in  some  sense.  One  way  of 
specifying  such  a  minimal  hyperbox  is  to  orient  the  faces  a  e  'g  the 
axis  o-f  point  "spread."  This  may  be  done  in  the  following  way.  Let 
A  br-  the  set  of  training  data  and  assume  S,  the  number  of  points  in  A, 
to  be  not  lass  than  n,  the  space  dimension.  The  first  step  in 
specifying  the  "minimal"  hypernox  is  to  determine  a  hyperplane,  H, , 
which  passes  through  the  set  of  points  in  A  and  which  is  oriented  along 
the  axis  of  distribution  of  the  points.  Let  this  hyperplane  be  denoted 
by  the  weight  vector  (hyperplane  normal)  %  =  (“-n .  o>.  )  and  the 

real  number  T.  More  precisely  this  hyperplane  is  the  one  which  minimizes 

(38)  z-j  =  l  o-j. 

where 

(33)  5li  =  ”1  *  H  ~  TV 

For  z-,  to  be  minimized  it  is  necessary  that 

(4U'  °»  j  =  *’  2’  ***»  n 

lo 

and 

az, 

<41> 

From  (38)  and  (39) 
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ywo***.  > 


•**&&»****  ’xw5S7R^ajas^B^^^wj?«5q>iS@S»®^@ge0^WS« 


(42) 


and 


3Z 


j^J"0’2^  aij  h  -  T1 ) •  J  “ 1  *  ••••" 


1 


(43) 


3Z 

3T 


-  =  0  »  -  2  l  (£.  •  a.  -  TO. 
1  i=l  1  1  1 


From  (43) 


(44) 


ST,  = 


-f  r>  -> 

“*  ’  ili  a* 


or 

(45)  T1  =  “l  *  P 


where  p  =  i 


S 

<£. 


is  the  first  moment  of  the  set  A. 


The  set  of  equations  (42)  form  a  homogeneous  system  of  n  equations 
in  n  unknowns  since  Tf  is  dependent  on  w-j  by  (45).  In  order  to  determine 
a  nontrivial  solution,  a  constraint  needs  to  be  added  to  the  system  (42). 
One  reasonable  constraint  is  to  assume  that  the  sum  of  the  weights  and 
threshold  is  not  zero  or 


(46)  5,  •  I  4  T1  =  k  f  0 


where 

I  =  0,  1,  •**,  1). 

Thus,  T-j  =  k  -  u-j  I  and  (42)  becomes 
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After  a  solution  to  (48)  is  obtained  the  second  hyperplane,  Hg, 
denoted  by  its  weight  vector,  u>2,  and  real  number  T2>  may  be  found  which 


mini mi zes 


l  ^ 


subject  to  the  constraints  that 


(52)  «i  •  w2  ~  0 


where 


62i  =  u2  ‘  ai  '  T2* 


The  constraint  given  in  (52)  may  be  introduced  into  the  problem  by 
introducing  the  Lagrange  multiplier,  )  The  problem  thus  becomes  one 
of  minimizing 


j  2  y  y 

Zz  =  62i  +  X1  u2  *  “l 


This  minimization  requires  that 


=  0,  j  =  1 ,  2,  •  • • ,  n 


8Z2  8Z2 
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~  “2  J1  aij  (ai  +  I)  -  2  k  eij  +  xi  ui j 
Observe  that  from  (58) 

(61)  T2  =  •  p 

as  before.  Eqs.  (59)  and  (60)  give  the  problem  in  matrix  form  as 
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(62) 


r  n 

—  — 

—  — 

c r 

_ i 

C11  C12  *  *  •  Cln  “11 

“21 

b2 

C21  C22  *  ‘  *  C2n  “12 
•  •  • 

“22 

• 

• 

= 

•  •  • 

•  •  • 

• 

bn 

Cnl  Cn2  *  *  ‘  Cnn  “in 

“2n 

0 

“11  “12  *  *  *  “in  0 

X1 

_  __ 

~  — . 

— 

where 


(63) 


V 


S 

2  k  y 
1-1 


a.  • 
ij 


and 


(64) 


ai j  <an  +  ')• 


Continuing  in  this  nanner  to  step  £  gives 


bi 

cn 

C12 

*  '  *  Cln 

“11 

“21  * 

•  •  Vl,l 

b2 

• 

* 

C21 

C22 

.  .  .  c2n 

“12 

o>22  • 

• 

• 

*  *  Vl  *2 
• 

• 

(65) 

b 

n 

cnl 

Cn2 

’  •  *  Cnn 

“In 

• 

“2n  * 

• 

*  ‘  Vl,n 

0 

“11 

“12 

*  *  *  “In 

0 

0  . 

.  .  0 

0 

• 

• 

“21 

“22 

•  •  •  “2n 

0 

0  . 

.  .  o 

0 

“£-1,1 

Vl  ,2 

•  •  ’  Vl 

,n  0 

0  . 

.  .  0  J 

“a! 

“£2 
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where  z  =  2,  3,  •  •  • ,  n. 


S 

(66)  b.  =  2  k  l  a.. 

j  i=1 

and 

(67)  cjk =  2  j,  au  (aik + 11  • 

When  z  -  1  the  first  hyperplane  may  be  found  from  Eq.  (48).  Further¬ 
more,  since  k  is  arbitrary  and  nonzero  it  may,  without  loss  of  generality, 
be  set  equal  to  one.  Note  further  that 

(68)  Tj  =  J  j  *  p,  j  =  1,  2,  n. 

By  solving  the  n  problems  just  described,  the  face  hyperplanes 
may  be  found.  The  actual  face  positions  may  be  found  by  locating  two 
hyperplanes  parallel  to  each  H.  (their  weight  vectors  will  be  u.)  such 
that  all  of  the  points  in  A  lie  between  these  hyperplanes.  The  hyperbox 
faces  are  then  characterized  by  the  structure  [m. ;  T^u,  T-^],  i  =  1 , 

2 ,  •  •  • ,  n ,  where 

(69)  T.  =  Max  (S.  •  a,) 


(70)  T-  =  Min  («,  •  a,) 

U  j=l»-**,S  1  3 
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One  measure  of  the  volume  of  the  hyperbox  and  thus,  the  cluster¬ 
ing  of  the  data  is  the  distance  between  parallel  faces.  This  distance, 
d^ ,  is  given  by  the  equation 


(71) 


The  hyperboxes  and  corresponding  face  distances  were  computed  for  the 
data  representing  the  prolate  spheroid,  the  cylinder,  the  cube,  and  the 
hemispherical  boss.  This  information  is  shown  in  Tables  VI,  VII,  VIII, 
and  IX  respectively.  In  these  tables  T1  and  T2  correspond  to  T.  and  T.. 
of  Eqs.  (69)  and  (70)  respectively.  T3  represents  •  p  and  indicates 
where  the  box  center  is  with  respect  to  the  center  of  gravity  or  first 
moment.  If  the  box  center  and  the  center  of  gravity  overlap  then 
T3  =  (T1  +  T2)/2.  D  represents  the  face  distances  as  computed  from 
Eq.  (71). 


Hyperellipse 

A  second  type  of  closed  hypersurface  useful  in  the  classification 
of  target  data  is  the  hyperellipse.  If  such  an  ellipse  has  its  center 
at  the  origin  and  its  axis  of  symmetry  aligned  with  the  coordinate 
system  then  the  ellipse  is  defined  by 

(72)  df  x,2  +  d2  x22  +  •••  +  dn  x„2  =  1  . 

Generally  the  data  set.  A,  will  not  center  on  the  coordinate  origin 
nor  will  the  point  distributions,  forming  the  axis  of  synmetry,  align 
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with  the  coordinate  axis.  Under  these  circumstances  tne  ellipse  will 
be  described  by 


(73)  (x  -  t)  P  (J  -  f)T  =  1, 


where  g  is  the  ellipse  center  and  P  is  the  matrix  describing  the  ellipse 
orientation.  The  problem  now  becomes  one  of  estimating  the  paraineters 
£  and  P  for  an  ellipse  which  completely  encloses  the  data  set. 

One  easily  obtainable  estimation  of  these  parameters  may  be  made 
on  the  basis  of  the  location  *nd  orientation  of  the  hyperboxes  described 
above.  The  center  of  the  ellipse  may  be  aligned  with  the  center  of  the 
hyperbox  or  with  the  center  of  gravity  of  the  data  set.  Call  this 
center  %.  Then  a  new  coordinate  system  may  be  found  by  moving  the  old 
coordinate  system  origin  to  the  t  thus  forming  the  data  set 

(74)  A'  =  {y,  •••,  as'} 
where 

(75)  V«Vt 

The  orientation  of  the  ellipse  may  reasonably  be  chooser  to 
coincide  with  the  hyperbox  described  above.  Letting  u-| ,  "!>  t: 

the  ordered  ret  of  hyperplanes  found  for  the  hvperbox,  the  unit  vectors 
describing  a  basis  for  a  rotated  coordinate  system  aligned  with  these 
vectors  become 
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(76) 


=  »./ 
V 


,  i  =1.2, 


,  n. 


Thus,  the  data  set  in  this  rotated  space  becomes 


where 

(77)  (a,‘)T  =  ")T 

for 


(78)  W  = 


Thus,  since  W  is  nonsingular 


(79)  (I,-)1  -  W1  (J,1)1- 


Thus,  an  elliose  circumscribing  the  data  set  A"  has  a  defining  equation 
of  the  form  of  Eq.  (72). 

For  an  ellipse  to  circumscribe  this  data  it  is  necessary  that 
(80)  d  •  V.  -  1  <  0,  i  -  1,  2,  •••,  S 


where 
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^  =  »  ^2’ 


and 


(81 


-y  lip  lip  lip 

Vi  =  ^ai  1  »  ai2’  ”■»  ain^ 


The  determination  of  d  from  inequalities  (80)  will  thus  specify 
the  ellipse. 

This  problem  may  now  be  stated  as  a  linear  programing  problem 
by  modifying  (80)  to 

(82)  d  •  V-  +  $.  -  1 ,  i  =1,2,  • • • ,  S 
subject  to  the  constraints  that 

(83)  dj  >  0,  j  =  1,  2,  n 
and 

(84)  S-  >  0,  i  -•  1,  2,  •••,  S. 

Since  the  smallest  ellipse  is  required  an  appropriate  objective  function 
is  the  minimization  of 


S 

(85)  z=js.. 

Upon  solving  this  linear  p  ograrmring  problem  the  coordinate  system 
may  be  transferred  back  to  the  original  system  as  follows.  Let. 
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Then  the  equation  of  the  ellipse  becomes 


(87)  1  =  x"  D  (x")T 
but 

(88)  (x")T  =  tf1(x')T 

-  W'(S  -  l)T 

or 

(89)  1  -  (x  -  |)  (W1)7  Dlf'lx  -  UT. 


Thus,  the  enclosing  ellipse  for  each  object  class  is  specified  by  the 
set  of  parameters  r,,  W’  ^ ,  and  D. 
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CONCLUSIONS 

A  study  of  the  identification  of  targets  using  their  frequency 
responses  has  been  carried  out.  The  objects  considered  for  classification 
were  represented  by  a  set  of  points  in  a  twelve  dimensional  Euclidean 
space  correspono’ng  to  the  twelve  test  frequencies.  Only  one  object 
was  completely  s>mmetrical  and  thus  represented  by  a  single  point.  Some 
of  the  other  objects  considered,  e.g.,  a  spheroid,  cube,  hemispherical 
boss,  provided  a  different  return  depending  on  the  aspect  angle.  Thus 
each  object  was  usually  represented  by  a  set  of  points,  scattered  over 
a  portion  of  the  twelve-space.  The  first  and  most  important  fact  to 
ascertain  was  whether  the  classes  were  separable.  The  investigation 
revealed  that  indeed  all  classes  considered  were  separable,  moreover 
each  class  was  linearly  separable  from  all  other  classes. 

A  very  interesting  and  useful  fact  was  discovered  in  the  process 
of  constructing  the  hyperplanes  separating  the  various  classes.  It  was 
found  that  the  average  numoer  of  non  zero  components  of  the  separating 
vector  was  three.  It  thus  appears  that  the  dimensionality  of  the 
feature  extraction  process  may  be  substantially  reduced  from  the  twelve 
that  was  originally  postulated.  The  implications  of  this  discovery 
with  respect  to  the  reduction  in  the  equipment  complexity  required  for 
system  implementation  are  quite  obvious. 

In  order  to  test  the  reliability  of  detection  based  on  the 
separating  hyperplanes,  substantial  amount  of  additional  data  were 
obtained  covering  new  aspect  angles  for  the  various  objects  considered. 
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These  data  were  used  as  test  data  as  compared  to  the  first  data  which 
constituted  the  training  sets.  The  results  of  the  test  showed  that  all 
test  data  used  were  correctly  identified. 

A  study  was  also  carried  out  into  the  effects  of  noise  on  the 
separability  of  the  classes  considered.  The  effects  of  noise  on  both 
training  sets  and  test  sets  were  studied.  The  results  indicate  that 
when  the  training  sets  were  noise  free  the  system  would  correctly 
identify  noisy  test  sets  up  to  noise  levels  of  the  order  of  10  percent 
of  the  signal.  With  regard  to  noisy  training  sets,  it  was  shown  that 
class  separability  was  possible  up  to  noise  levels  of  50X  of  signal, 
indicating  that  an  adaptive  classifier  may  b<  very  beneficial  for 
situations  where  the  training  samples  are  either  noisy  or  the  population 
incompletely  known.  One  such  adaptive  procedure  has  been  developed  and 
found  to  converge  quite  rapidly,  in  most  cases  four  iterations  or  less 
were  adequate. 

An  automatic  target  classifier  has  been  developed  and  successfully 
nested  on  various  objects.  The  target  responses  used  were  contaminated 
by  noise  to  varying  degrees.  The  results  have  indicated  that  a  high 
probability  of  correct  classification  could  be  obtained  up  to  moderate 
amounts  of  noise  and  msasurement  errors. 

A  technique  has  been  devised  to  determine  target  dynamics,  such 
as  the  rotation  axis  of  a  satellite,  based  on  its  vector  trajectory 
in  n  space. 

A  study  has  been  carried  out  of  the  use  of  nonlint  separating 
hyoersurfaces  for  classification  of  objects  that  may  not  be  linearly 
separable.  Two  types  of  closed  surfaces  have  been  used,  hyperboxes  and 
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hyperellipsoids.  The  use  of  these  surfaces  led  to  an  assessment  of 
the  clustering  properties  of  the  various  objects  studied.  It  was 
found  that  the  object  classes  examined  were,  tightly  packed  and  occupied 
extremely  small  volumes. 

Further  work  in  assessing  the  resolvability  of  new  objects  and 
the  distribution  of  their  representative  point  classes  is  continuing. 


APPENDIX  A 


INTERPOLATION  ALGORITHM 

The  interpolation  of  the  backscatter  data  requires  interpolating 
twelve  functions  of  aspect  angle  one  for  each  of  the  illuminating 
frequencies.  There  are,  of  course,  many  procedures  which  may  be 
used  to  accomplish  such  interpolation.  In  virtually  all  cases,  a  very 
large  system  of  equations  must  be  solved  in  order  to  determine  some 
approximating  function  or  spline  fit.  Furthermore,  each  system  of 
data  requiring  interpolation  has  a  variable  range  of  aspect  angles  to 
be  considered.  Because  of  these  things  a  local  interpolation  pro¬ 
cedure  was  desired. 

Consider  the  situation  shown  in  Fig.  A1 .  The  function  f(x)  has 
been  specified  by  its  values  at  x  =  xQ,  x^ ,  •*•,  xfi.  A  particularly 
straight  forward  wjy  of  interpolating  f(x)  is  to  connect  adjacent 
points,  say  f(x.)  c  id  f (x -+-j ) ,  by  sections  of  one  or  more  quadratics. 


Fig.  A1 .  Int'  'polation  schema. 
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In  particular  consider  the  values  of  the  function  at  the  points 

x.j,  and  x.j+i .  Call  the  functional  value  y^,  yi  and  y^-j  respectively. 

These  three  points  may  then  be  used  to  describe  a  quadratic  of  form 


(Al) 


y  =  ail  x 


+  ai2  x  +  ai3 


which  is  defined  by 


(A2) 


yi-l 

Xi-1 

1 

ail 

yi 

= 

xi2 

xi+1 

xi 

1 

ai2 

yi+l 

xi+l 

1 

ai3 

for  i  =  1,  2,  n-1.  A  solution  to  { A2)  will  produce  a  quadratic 

approximation  of  f{x)  given  by  (Al)  for  the  range  £  x  £  xi+1 
Because  a  solution  to  (A2)  for  all  i  in  the  specified  range  is  straight 
forward  a  better  approximation  may  be  made  as  follows: 

I 

(A3)  y  -  x2  +  a12  x  *  >13.  xQ  <  x  <  x  , 


(A4)  y  -  ttlAlhA  x2  +  ♦  lidAllhl  , 


xi_1  i*  lxj>  i  =  2,  3,  n  -  1 


and 


(AS)  y  =  Vl,l  x  +  an-1,2x  +  an-l,3’  Vl  ± x ± V 
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This  approximation,  effectively,  is  ar  average  of  two  quadratics  for 
all  x  outside  of  the  ranges  from  xQ  to  x,  and  xn_-|  to  xR.  Since  t.he 
data  used  here  is  well  behaved  in  the  sense  that  the  first  and  second 
derivatives  are  small,  the  lack  of  slope  matching  at  the  approximation 
boundaries  should  not  appreciably  affect  the  interpolation. 

The  appropriate  coefficients  required  by  (A3),  (.'4)s  ar.u  (A5)  may 
be  found  from  Eq.  (A2)  after  making  some  appropriate  assumptions.  Since 
the  aspect  angles  for  the  data  were  taken  in  equal  steps,  assume  that 
xi  =  x0  +  1,5  w^,ere  6  is  the  spacing  increment.  Finally,  without,  loss 
of  generality,  it  may  be  assumed  that  xQ  =  0.  Thus,  solving  (A2)  yields 


*i-i  -  2*i  +  y<+i 


(A6)  a,,  =  - ? 

26  ^ 


(A7) 


ai2  = 


' 2i  +  1)  y-j,]  +  4iyi  -  (2i  -  1)  yi+j 


26 


(A8) 


.2 


ai  3  = 


i(i  +  1)  y1_1  -  2(i  -  1)  yi  +  i(i  -  1)  yi+. 


for  i  =  1 ,  2,  • • • ,  n  -1 . 
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Tn  order  to  obtain  an  exact  solution  to  the  inverse  scattering  problem, 
t-h«»  target  response  is  required  over  a  continuous  band  of  frequencies  and 
aspect  angles.  An  alternative  approach  to  the  problem  which  requires  a  much  more 
modest  amount  of  measurement  data  but  assumes  substantial  a  priori  information  is 
as  follows:  A  finite  set  of  alternatives  is  specified  regarding  target  shape 
and  composition  and  a  set  of  measurements  is  carried  out  to  provide  the  answer 
as  to  which  one  of  the  alternatives  holds.  Since  it  is  necessary  to  restrict 
the  measurements  to  a  relatively  small  range  of  frequencies  the  question  arises 
as  to  what  frequency  band  is  best  to  characterize  the  target  and  to  provide  the 
most  reliable  discrimination  from  other  targets.  The  best  apparent  choice  is 
the  highest  frequency  possible  since  for  a  given  percent  bandwidth  it  would 
contribute  the  greatest  amount  of  information.  However,  an  examination  of  the 
Fourier  transform  of  the  impulse  response  ludiCutcd  that  the  frequency  tau^c 
corresponding  to  wavelengths  starting  with  the  size  of  the  object  and  increasing 
to  ten  tiroes  its  dimension  would  provide  the  most  useful  initial  information. 

It  is  low  frequency  range  which  is  used  in  this  paper; to  characterize  the  various 
objects  of  interest,  and  provides  the  information  used  to  identify  the  objects. 
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